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The effectiveness of word stuffing for synchronization depends upon our 
ability to distinguish the stuff words from the data words at the destination 
and, thus, delete correctly the stuff words. If all input sequences are per- 
mitted, the data words must be encoded before stuffing occurs so that the 
stuff word can be distinct from the data words. In this paper, we give the 
code that, for a given redundancy , maximizes the minimum distance be- 
tween the stuff word and any data word. This helps to prevent the loss of 
character synchronization because of transitions between data and stuff 
words due to transmission errors. In contrast to the perfect distance sym- 
metries between the code words of the group codes normally encountered 
in error-control work, the primary virtue of the present code is its highly 
asymmetric distance structure. 

I. INTRODUCTION 

When a digital communication network is used for data transmission, 
it may be necessary to adjust transmission rates within the network to 
achieve synchronization. Word stuffing 1 '' is a technique that can be used 
for this purpose. The basic idea is to group the transmitted bits into 
words which we call data words. The data words are formed for trans- 
mission and are not related to any word structure that may exist in 
the customer's data stream. Stuff words, which are distinguishable 
from the data words, are inserted into the stream of data words at the 
transmitter. Thus, transmission rates can be adjusted within the net- 
work by inserting or deleting stuff words. At the destination, the stuff 
words are deleted whereas the data words are delivered to the customer. 
The effectiveness of word stuffing depends upon our ability to dis- 
tinguish the stuff words from the data words at the destination and, 
thus, delete correctly the stuff words. If the stuff words are incorrectly 
deleted, bits will be inserted into or deleted from the customer's data 
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stream. As a result, the received bits will be incorrectly formatted and 
incorrectly interpreted. When this occurs, we say that character syn- 
chronization is lost. Character synchronization is important in data 
transmission because once it is lost, subsequent bits are erroneously 
interpreted even if transmitted correctly. 

Two problems arise. First, we require that all input sequences are 
allowed. Thus, redundancy must be added to the data words by an 
encoder so that it is possible to choose a stuff word that is distinct from 
the data words. Second, when transmission errors occur, it is possible for: 

(i) a stuff word to be transformed into a data word, 
(ii) a data word to be transformed into a stuff word, or 
(Hi) a data word to be transformed into another data word. 

In most cases, (i) and (ii) are more serious than (Hi) because of the 
resulting loss of character synchronization. The prevention of type (Hi) 
errors is generally performed by the customer's terminal, when required, 
and is not considered in this paper. 

Previously, Mattesich and Richters 1 proposed a format for the data 
words and the stuff word. The format results in the stuff word being 
distance one* from a data word. Therefore, a single transmission error 
can change a data word into the stuff word or vice versa with a corre- 
sponding loss of character synchronization. 

We give an alternative encoding scheme that, for a given redundancy, 
maximizes the minimum distance between the stuff word and any data 
word. This helps to prevent the loss of character synchronization be- 
cause of transitions between data and stuff words due to transmission 
errors. An implementation is given for arbitrary word size and re- 
dundancy. For a redundancy of one bit, a particularly simple encoding- 
decoding technique is described. 

Some other methods of achieving synchronization by means of stuffing 
have been proposed. A word stuffing technique, proposed by Butman , 
achieves a distance d between the stuff word and any data word by 
inserting deliberate errors in certain data words at the transmitter to 
keep the data words at least distance d from the stuff word. Butman's 
technique requires some knowledge of the statistics of the transmitted 
data words for selection of the stuff word and a relatively large word size 
so that the deliberate insertion of errors is infrequent. It results in 
deliberate errors in the customer's data and prohibits the reception of 



♦The distance, frequently called the Hamming distance, between two binary 
words X and Y is the number of positions in which X and Y differ. 
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certain data words. The latter point is troublesome if the data words 
used in transmission are identical to the data words of the customer. 
Pulse stuffing, rather than word stuffing, can be used to achieve 
synchronization. Individual pulses are inserted at the transmitter and a 
separate data link is used to signal the locations of the stuff pulses. 
References 3 through 6 are representative of the work in pulse stuffing. 

II. PRELIMINARIES 

A model of the processing before transmission is shown in Fig. 1. 
The input binary data stream is segmented into k-bit data words de- 
noted by A, where 

A = (a k , a t _i , • • • , Oi). 

We assume that all of the 2* possible sequences for A are allowed. In 
order for the stuff word to be distinguishable from the data words, the 
alphabet is enlarged by adding redundancy to A. Thus, the encoder 
generates from A an ?i-tuple B where n > k and 

B = (b n , &„_, , • • • , &,). 

The sequence B is transmitted. The dimensions of such a code are 
denoted by (n, k). 

There are 2 n possible sequences for B of which 2* are used to transmit 
data. Thus, there are 2 n — 2*"' values of B that are not used for data but 
that can be used for other purposes including the stuff word. For n > k, 



2" — 2 > 2 



(1) 



The reader should appreciate the tremendous flexibility available in the 
design of the coding scheme because, by (1), never more than half of the 
possible B sequences are used as data words. 

The processing after transmission is indicated in Fig. 2. As shown 
later, it is possible to construct codes that have a minimum distance 
greater than one between the stuff word and any data word for practical 
values of n and k. For these codes, the first step at the destination is to 
delete the stuff word and all other received words "sufficientlv close" 
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Fig. 1 —Processing before transmission. 
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Fig. 2 — Processing after transmission. 

to the stuff word. In saying "sufficiently close," we mean that the re- 
ceived word is more likely the result of a stuff word corrupted by errors 
than a data word corrupted by errors. Let B' denote the w-tuples that 
are not deleted where 

B' = (K,M-i , ••• , &0. 

The decoder operates on B' to form the /c-bit word A' where 

A' = (a' k , oi_, , ••• , a[). 

The system functions properly if 

A' = A. 

At this point, we describe for reference the encoding and decoding 
schemes presented in Ref. 1. An (8, 7) code is used, that is, n = 8, 
k = 7. The encoder generates B from A by the relation 

b» = 1 

6,. = a,- for 1 £ t £ 7. 



Thus, 



B = (1, a 7 , a , • • • , a,). 



The stuff word is (00000001). At the receiver, only the word (00000001) 
is deleted as the stuff word; all other received words are decoded as 
data words. Thus, the stuff word is interpreted as a data word if one or 
more transmission errors occur. Alternatively, because the data word 
(10000001) is distance one from the stuff word, a single error can con- 
vert this data word into the stuff word. Any other data word requires 
at least two errors for conversion into the stuff word. 



III. DESCRIPTION OF THE CODE 

We construct a code that has one stuff word and 2* data words such 
that the minimum distance between the stuff word and any data word 
is maximized. The problem is to divide the 2" n-tuples into three sets; 
namely, 
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(i) the stuff word denoted by S, 
(ii) the set D consisting of the 2* data words, and 
(in) the set U consisting of the 2" — 2* — 1 unused words, 

such that, for given values of n and k, the minimum distance between S 
and any element of D is maximized. 

Choose one of the ??-tuples to be the stuff word S. For given n, k and S, 
let d m denote the maximum possible minimum distance between S and 
any element of D. We determine d m by observing that if all elements of 
D are to be at least distance d m from S, then U must contain all n- 
tuples that are distance d m — 1 or less from S. Thus, the set U contains 

the ( ] n-tuples distance 1 from S, 

the ( ) n-tuples distance 2 from *S, (2a) 



the ( , ) w-tuples distance d m — 1 from S, 

and 5 n-tuples distance greater than d m — 1 from S such that 

dm— 1 / \ d m I ' 

(n\ „ „„ „ t , . v 1 ^ In 



and 



g^ 2 "- 2 '-^^ W 



« = 2" -2'- 1- E ( n J- (2c) 

The data words are the remaining 2* words. 

The value of d m is determined by (2b). From (2b), it follows that d m 
is independent of S and is determined entirely by n and k. Also, it is 
easy to show that changing a code by adding* a constant rc-tuple to 
all words simply rotates the code with no change in the distance prop- 
erties, including d m . 

The code is specified by (2) up to the choice of the 8 unused words 
that are at distance greater than d m — 1 from S. While the choice of 
these 8 words does not alter d m , the probability that a data word is 
transposed into the stuff word is minimized if the 5 words are all chosen 



* The addition is component-by-component modulo two addition and is denoted 
by ffi. 
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to be distance d m from S. In practice, the choice of the 8 words does not 
change this probability substantially and it appears preferable to assign 
the 8 words to simplify encoder-decoder design. 

The case n = k + 1 is of interest because of the low redundancy. In 
Appendix A, it is shown that for n = k + 1, 



d m = 



+ 1 



where [x] denotes the largest integer less than or equal to x. Also, for 
n = k + 1 and k even, it is shown in Appendix A that 8 = 0. A plot of 
d m versus n for n = k -j- 1 and n = k + 2 is given in Fig. 3. 




Fig. 3 — Maximum minimum distance d m for various n. 

Transmission of the all-zero word can be avoided by choosing S so 
that the all-zero word is one of the unused words. However, it may be 
convenient, particularly when detecting stuff words, for S to be the all- 
zero word because then distance from S is equivalent to weight* and 
can be computed by counting ones. These two objectives can be simul- 
taneously satisfied as follows. Design the encoder and stuff word using 
a code C in which S is the all-zero word. Let S' be one of the unused 
words in C. Add S' to each word immediately before transmission and 

* The weight of a binary word X is the number of nonzero components in X 
and is denoted by w(X). 
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add S' to each word immediately after transmission. The double addi- 
tion of S' permits the suppression of the all-zero word for transmission 
but is transparent for the encoding, stuff word detection, and decoding 
operations. 

Because the data words and the stuff word form a subset of the set 
of all possible n-tuples, we may, on the average, transmit an unequal 
number of zeros and ones. By varying w(S) from to n, the relative 
number of zeros and ones in the data words can be varied from mostly 
ones to mostly zeros. 

IV. ENCODING, DECODING AND STUFF WORD DETECTION : GENERAL CASE 

In this section, we specify an encoder that achieves d m for arbitrary 
n and k. Let the stuff word be the all-one n-tuple. A necessary and 
sufficient condition for the encoder to achieve d m is that each A must 
be encoded into a unique B such that the maximum weight of any B is 
n — d m . 

We begin by regarding each A as the /c-bit natural binary representa- 
tion of some integer a, ^ a ^ 2* — 1. Thus, 

A = B k (a) 

where 

We can construct a table that, for each a, gives the corresponding A 
and B sequences. The entries are in the order of increasing a. For illus- 
tration, consider the (9, 7) code where d m = 6. The first 24 entries for 
the (9, 7) code are shown in Table 1. The A column corresponds to 
counting in binary from the all-zero /c-tuple to the all-one A;-tuple. The 
B column is also formed by counting in binary except that all n-tuples 
with weight greater than n — d m are omitted from the count. It follows 
that the maximum weight of any B is n — d m . The arrows in Table 1 
indicate where 9-tuples with weight greater than three have been 
omitted in the B column. 

Let us examine the counting in the B column of Table 1 in greater 
detail. Consider counting to the B sequence for a = 22, that is, 



(000011000). (3) 

TT 
position 5 — ■ — position 4 

First, count to (000010000). At this point, all 4-tuples of weight not 
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Table I — Relationship Between A and B for (9, 7) code; 
Only the First 24 Values of A are Shown. 
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000011000 


23 


0010111 
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greater than three have been used in positions one through four. Then, 
count to the sequence in (3) by using all 3-tuples of weight not greater 
than two in positions one through three while keeping a one in position 
five. As shown in Fig. 4, the value of a associated with the sequence 
in (3) has two components. The first component, denoted by <Xi(5)> is 
the number of B sequences used in counting to obtain the one in position 
five. Similarly, the second component, denoted by a 2 (4), is the number 
of B sequences used in counting to obtain the one in position four. 
Accordingly, 



where 



a = ai (5) + a 2 (4) 



«i(5) - E ;• - is, 



a 2 



(4) = L , = 7. 



The above ideas can be formalized so that, for an arbitrary B (de- 






WORD STUFFING CODE 



387 



000000000 



000001 1 10 
0000 10000 



0000101 10 
0000 1 1000 



-a, (51 



- "2(4) 



Fig. 4 — Counting to determine a. 

noted by B ), it is possible to determine the corresponding value of a 
(denoted by <x Q ). Let co = w{B Q ), co ^ n — d m . Let & , 2 , • • • , /3 a de- 
note the position numbers of the co nonzero components of B where 



01 > 02 > 



> /3„. 



For example, if B = (000011000), a = 2 and ft = 5, 2 = 4. Let a t (&) 
denote the contribution of the one in position 0,- to a > that is, 

<*o = I>,(0,). 
• -1 

Observe that a,(0.) is the number of sequences in the B column 
from the sequence whose nonzero components are in positions 
0i , 2 , • • • ,0,_i to the sequence whose nonzero components are in 
positions 0i , 2 , • • • , 0, . Thus, a,(0,) is the number of (0,- — l)-tuples 
of weight not greater than n — d m — i -f- 1 and is given by 

Min(n-dm-i' + l./3<-l> 



«.-(&) - 



It follows that 



«0 



= L 



Min(n-rfn-> + 1.0i-l) 



0. - 1 

m 



0.- - 1 
m 



(4) 



Equation (4) can be used to find B when a is given, that is, to design 
an encoder.* For each value of i, 1 ^ i ^ n — d„ , construct an array. 
In the ith array, list ; and a { (j) as j runs from one to n — i + 1. To 
encode a » find in the first array the largest ; (denoted by ;',) such that 

aiO'i) = a o • 



♦The ideas in this paragraph are illustrated by a numerical example in 
Appendix B. 
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Next, find in the second array the largest j (denoted by ;,) such that 

In the ith array, find the largest j such that 

a,0',) ^ « - £o»0'»). 

m-l 

The process continues until 

£ Om(jm) = «0 

m-l 

for some o>' ^ n — d m . It follows that B has ones in positions 
H ti% t "* * i« ' an d zeros in all remaining positions. Also, &>' = w, the 
weight of B . 

It is possible to obtain a recurrence relation for the elements of the 
arrays. It is shown in Appendix C that 

oii(j) + a i+l (j) = a<(j + 1) (5) 

for 1 ^ ; ^ n — i, 1 ^ i ^ n — d m — 1. By knowing that a;(l) = 1 for 
1 ^ i ^ n - d m and that a n - da (j) = jiorl £ j 2» d» + 1, equation (5) 
can be used to generate elements of the arrays. 

Also, equation (5) can be used to construct an encoder directly. 
First, we specify the subtraction and storage device shown in Fig. 5. 
Let R denote the integer stored in the device. The output is equal to the 
integer stored in the device, that is, R. Let the input be an integer 
R', ^ R' ^ R. When the device is activated, the number stored in 
the device becomes R — R' and, thus, the output also takes the value 
R - R'. 



INPUT 



OUTPUT 



ACTIVATE 

Fig. 5— Subtraction and storage device. 



The encoder operates as shown in Fig. 6. The storage devices are 
preset so that R< = a, (n - i + 1), 1 ^ i ^ n - d m , and T = a, the 
integer representation of the A sequence to be encoded.* Position the 



* For the (9, 7) code, the preset values are Ri = 93, R 2 = 29 and Ra = 7. 
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Fig. 6 — Encoder for arbitrary n and k. 
Preset Values: T = a 

Ri = ai(n) 

Rz = a 2 (n — 1) 



Rn-d m = Ot n -d m (dm + 1) 



switch so R = Ri . If T < Ri , the n — d m storage devices are activated 
and their contents reduced. Also, a is transmitted, that is, 6„ = 0. 
However, if T ^ i2 t , a 1 is sent, Ri is subtracted from T, and the 
switch is shifted so R = R 2 . The process continues until the entire B 
sequence is generated. Notice that the B sequence is generated and, 
thus, transmitted with b n first and b x last. 

The decoder is shown in Fig. 7. The storage devices are preset so that 
R { = a, (n — i + 1), 1 ^ i ^ n — d m , and the accumulator is set 
equal to zero. Position the switch so R = R x . Prior to the decoder, if 
the weight of the received n-tuple is greater than n — d m because errors 
have occurred, the weight is reduced ton — d m by arbitrarily converting 
sufficient ones to zeros. Thus, we assume that w(B') ^ n — d m and 
that B' arrives with b'„ first and b[ last. The decoder first considers b' n . 
If b' n = 0, the n — d m storage devices are activated and their contents 
reduced. If b' n = 1, the accumulator is increased by R x and the switch 
is shifted so R = R 2 . The process continues until b[ has been used. 
After b[ has been used, the accumulator contains the integer representa- 
tion of A'. 

For the stuff word detector, S and all other received words less than a 
specified distance from 8 are deleted. Thus, the detector counts the 
zeros in each word and, if the count is less than the specified distance, 
deletes the word. 
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Fig. 7 — Decoder for arbitrary n and k. 
Preset Values: Ri = a\{n) 

Ri = aiin — 1) 



Rn-d m = «„-*„.(<*„ + 1) 

Accumulator = 



V. ENCODING, DECODING, AND STUFF WORD DETECTION \ 71 = k + 1 

In this section, we give a possible implementation for the encoder, 
decoder, and stuff word detector for n = k + 1. 

5.1 Case 1 

Let the stuff word S be the all-zero (k + l)-tuple. The encoder is 
specified in (6) and shown in Fig. 8. In Fig. 8, the A sequence is assumed 
to arrive with a k first and <z x last. Similarly, b k+1 is transmitted first 
with &! last. 



If w{A) ^ 



If w(A) ^ 



+ 1, b k+l = 0. 



b* + i = I- 



(6) 



Then &,• = 6 t+1 © a, for 1 ^ i ^ k. 



Decoding is also straightforward. The operation necessary for de- 
coding is 



a'i = b' k+1 b'< for 1 ^ i ^ k. 



(7) 



Figure 9 shows the decoder in equation (7). In Fig. 9, it is assumed 
that 6£ +1 is received first and b[ is received last. The stuff word detector 
is the same as in Section IV except ones are counted instead of zeros. 






A=(a„a 2> ...,a k ) 

i 

i 

n 4 . 
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(a 1 ,a 2 ,...,a k ) (b,,b 2 ,...,b k ) (b„b 2 ,..., b k+l )=B 



DELAY 



Y Je 




b* + ,-o 



b k+l =i - 



bk- 



Fig. 8 — Encoder for Case 1. 

The encoding-decoding technique in (6) results in some error multipli- 
cation. Suppose that position i, 1 ^ i ^ k, is in error and that position 
k + 1 is correct. Then, from equations (6) and (7), 

a' t = b h+1 © 1 © hi = 1 © a, . 

The error in position i is delivered to the customer. However, if position 
k -f 1 is in error, from equations (6) and (7), 

< = 1 © b k+l © 6,- = 1 © a,- for 1 £ i g k. 

The point is that now all k data positions are in error. However, charac- 
ter synchronization is maintained because the correct number of bits 
are delivered to the destination. 

As noted in Section III, it is possible to design a code for which the 
stuff word is the all-zero word and then, for transmission, suppress the 
all-zero word by adding an unused word to each word before trans- 



B'=(b;,b 2 b k+l ) (b;.b 2 b' k : 



(a',,a 2 ,...,a'iO=A' 



r 
i 

b'k+i 



Fig. 9— Decoder for Case 1. 
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mission. However, it is easy to combine the encoding and addition 
operations. We give two examples. 

s.2 Case 2 

Let the stuff word be 

S = (1 ••• 0). 

| k positions 

position k + 1 

The encoder is given in (8). 



If w(A) ^ 



If w{A) £ 



+ 1, b k+1 = 1. 



b k+1 - 0. 



Then b t = 1 © b k+l © a,- for 1 ^ i ^ k. 
The decoder performs the operations in (9). 

a' { = 1 © b' k+1 © b{ for Utgfc. 



5.3 Case 3 

The stuff word is 



8 = (0 ••• 1 ••• 1). 

j kx positions 

position k + 1 



The encoder is specified in (10). 



If w(A) ^ 



If w(A) g 



+ 1, b k+l = 0. 



b k+1 = 1. 



Then 6, = 1 © b k+1 © a,- for 1 ^ t ^ fc, 

6, = b k+1 © a,- for fc, + 1 £ f £ Jb. 

The decoder performs the operations in (11). 



(8) 



(9) 



(10) 
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a\ = 1 © b' k+ i © &{ for lgigk lf 

a'i = b' k+1 © &{ for fc t + 1 £ » £ k. (11) 

Notice that if A; t = k, the decoder for Case 3 is identical to the decoder 
for Case 2. 

VI. THE (8, 7) CODE 

Because Bell System PCM channels use a basic 8-bit word, the (8, 7) 
code is of interest. From Fig. 3, it is possible to design an (8, 7) code 
with d m = 4. Let R, and R d denote the stuff rate and the rate of occur- 
rence of the data words that are distance four from S, respectively. 
When the stuff rate is low (R, < R d ), the decision rule at the receiver 
is biased in favor of the data words by deleting as stuff words all received 
words distance one from S and decoding as data words all remaining 
received words. Conversely, for high stuff rates (R. > R d ), the decision 
rule is biased in favor of the stuff words by deleting as stuff words all 
received words distance two or less from S and decoding as data words 
all remaining received words. When the rates are approximately equal 
(R. ~ R d ), the two decision rules give comparable performance. 

It is possible to modify the (8, 7) code by relaxing the minimum 
distance requirement so that all data words are merely required to be 
at least distance three from S. Received words distance one from S are 
deleted as stuff words; the remaining received words are decoded as 
data words. This balanced code gives roughly the same performance as 
either of the biased codes when R, ~ R d . However, the reduction in 
minimum distance provides for less error multiplication (see Section V). 

Let T d \, denote the mean time between erroneous conversions of a 
stuff word into a data word and let T.\ d denote the mean time between 
erroneous conversions of a data word into a stuff word. We use the 
following assumptions: 

(i) Transmission errors are independent of the transmitted bits, 
independent of each other, and occur with probability p = 10~ 7 . 

(ii) The transmission rate is 64 X 10 3 bits per second or, because 
n = 8, 8000 words per second. 

(Hi) All 7-bit input words are equally likely. 
(iv) The stuff rate is R, stuff words per second.* 

* The value of R, will vary depending upon the application. For fine adjust- 
ment of clock rates, R. typically would be less than 20 words per second. If word 
stuffing is used for speed padding as well as adjusting clock rates (for example, 
to send 50 kilobit service over a 64-kilobit line), R, could be 850 words per 
second or larger. 
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The balanced (8, 7) code is applicable except for extremely high or low 
stuff rates. Let N 3 denote the number of data words distance three from 
S (the exact value of N 3 depends upon the encoder). Then 



Tu.& 



9.93 X 10 



3600#. 
and 



*vi B - 



hours 



f - = ■ rae i = ^!(8 8 ooo- ^.) houre - 

3600(8000 -R,)V^r-p 

By a modification of the encoder-decoder in Case 1 of Section V, it is 
possible to reduce the error multiplication in the decoder. The stuff 
word is 

S = (00000000). 

The encoder is given in (12). 

If w(A) ^3, 6 8 = 0. 

If w(A) £2, 6.-1. (12) 

Then 6,- = 6 8 © a,- for 1 £ tf £ 4, 
b, = a,- for 5 ^ t* ^ 7. 

The decoder performs the operations in (13). 

a j = b' 8 6J for 1 ^ i ^ 4, 

a< = 6< for 5 ^ * ^ 7. (13) 

Notice that an error in position eight now results in four rather than 
seven errors for the customer. 

VII. COMPARISON WITH GROUP CODES 

In the binary group codes normally encountered in error-control 
work, the code words are a set of 2* n-tuples selected so that the code 
words form a group under component-by-component modulo two addi- 
tion. Because of the resulting perfect distance symmetries between the 
code words,* the probability that a transmitted word is decoded in 

* Let X and Y be code words. For any e(l ^ « g n), the number of code words 
distance e from X \s equal to the number of code words distance « from Y for all 
Zand Y. 
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error does not depend upon the transmitted word (provided the trans- 
mission errors are independent of the transmitted bits) . 

The code presented herein has 2* -f- 1 code words (the 2* elements of 
D plus the stuff word S). The code words do not form a group and ex- 
hibit highly asymmetric distance properties. It is the asymmetric dis- 
tance properties that enable us to use the available redundancy to 
protect against the loss of character synchronization due to transmission 
errors. 

We note that it is possible to design other asymmetric codes that 
are, in a sense, generalizations of the code in (2). Instead of a single 
stuff word, there are now several special words with unique distance 
properties with respect to each other and the set D. Such codes might 
be used in a data transmission system where, for example, one wishes 
to provide more protection for control characters than for data words. 
A wide range of capabilities is possible and future work in the design 
of these codes should prove profitable. 

VIII. CONCLUSIONS 

For a given redundancy, we give the code that maximizes the mini- 
mum distance between the stuff word and any data word. An encoder and 
decoder are given for arbitrary n and k. For n = k + 1, a particularly 
simple encoding-decoding technique is described. Certain properties of 
the (8, 7) code are considered in detail. 
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APPENDIX A 

Derivation of d m jor n = k + 1 

Let n = k + 1. From (2b), d m is chosen so that 



However, 



gftv-Kset 1 )- <"> 

2* + > -2=1: 



k + 1 



Therefore, for k even, 
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*— gft 1 ) 

which, from (14), implies that 

and, from (2c), that 5 = 0. For k odd, 



*-i-"f Pt'M 



/C + 1 
+ 1 



(15) 



where [k/2] denotes the largest integer g fc/2. Thus, from (14) and (15), 



d m = 



+ 1. 



APPENDIX B 

Encoder for (9, 7) Code 

For the (9, 7) code, d m = 6. Thus, construct the three arrays shown 
in Table II. Consider encoding a = 22. In the first array, ai(5) = 15 is 
the largest ai(j) not greater than 22. Therefore, ;\ = 5. Next, in the 
second array, we find that the largest a 2 (j) not greater than 

a - a x (5) = 22 - 15 - 7 

is a 2 (4) = 7. Thus, j 2 = 7. However, 

E ajj m ) = ai (5) + a 2 (4) = 22 = a 

m-l 

Table II — Arrays for Encoding for the (9, 7) Code. 



Array 1 




Array 


2 


Array 3 


j OCl(j) 


j 




CCiij) 


3 


aiCfl 


1 1 


1 




1 


1 


1 


2 2 


2 




2 


2 


2 


3 4 


3 




4 


3 


3 


4 8 


4 




7 


4 


4 


5 15 


5 




11 


5 


5 


6 26 


6 




16 


6 


6 


7 42 


7 




22 


7 


7 


8 64 


8 




29 






9 93 
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so the process terminates. Therefore, B has ones in positions j s = 5 
and j 7 = 4, that is, 

B = (000011000). 

TT 
position 5 — ■ ■ — position 4 

APPENDIX C 

Proof o) Equation (5) 
After substituting (4) into (5) , we must show that 

Min(n-d m -t + l.f-l) /• _ j\ Min(n-dm-i.l-l) /• _ -. 



E V + E 



£ (L) 



(16) 



for 1 & j £ n — i,l ^ * ^ n — d m — 1. Choose an i and consider ; as 
j increases from 1 ton - i. Suppose that 1 ^ j ^ n — d m — % + 1. 
Then (16) reduces to 

,£o \ m / £r \ m / £r \ra. 

or 

2'" 1 + 2' -1 = 2''. 

Now, suppose that w — d B — t + 1 < j fa n — t. Then (16) becomes 
„_ d +1 , _ | 



m-o x m 
However, 



l )+ s r~ = s ( J j- 



n-d.-I+l / \ 

- s (:)• 

The argument is valid for each i, 1 ^ t ^ n — d m — 1. 
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